Voice Recognition using Dynamic Time Warping and Mel-Frequency Cepstral Coefficients Algorithms
نویسندگان
چکیده
منابع مشابه
Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques
Digital processing of speech signal and voice recognition algorithm is very important for fast and accurate automatic voice recognition technology. The voice is a signal of infinite information. A direct analysis and synthesizing the complex voice signal is due to too much information contained in the signal. Therefore the digital signal processes such as Feature Extraction and Feature Matching...
متن کاملSupport Vector Machines and Mel-Frequency Cepstral Coefficients: an Application for Automatic Voice Recognition
The speech recognition problem can be modeled as a classification problem, where one wants to get the best degree of separability between classes representing the voice. In order to apply that concept to build an automated speech recognition system capable of identifying the speaker, many techniques using artificial intelligence and general classification have been developed, which lead to this...
متن کاملThe Capacity of Mel Frequency Cepstral Coefficients for Speech Recognition
Speech recognition is of an important contribution in promoting new technologies in human computer interaction. Today, there is a growing need to employ speech technology in daily life and business activities. However, speech recognition is a challenging task that requires different stages before obtaining the desired output. Among automatic speech recognition (ASR) components is the feature ex...
متن کاملMel Frequency Cepstral Coefficients for Music Modeling
We examine in some detail Mel Frequency Cepstral Coefficients (MFCCs) the dominant features used for speech recognition and investigate their applicability to modeling music. In particular, we examine two of the main assumptions of the process of forming MFCCs: the use of the Mel frequency scale to model the spectra; and the use of the Discrete Cosine Transform (DCT) to decorrelate the Mel-spec...
متن کاملMultiple Time Resolutions for Derivatives of Mel-frequency Cepstral Coefficients
Most speech recognition systems are based on melfrequency cepstral coefficients and their firstand secondorder derivatives. The derivatives are normally approximated by fitting a linear regression line to a fixed-length segment of consecutive frames. The time resolution and smoothness of the estimated derivative depends on the length of the segment. We present an approach to improve the represe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2015
ISSN: 0975-8887
DOI: 10.5120/20312-2362